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The present paper provides exact expressions for the probabil- 
ity distributions of linear functionals of the two-parameter Poisson- 
Dirichlet process PD(a,#). We obtain distributional results yielding 
exact forms for density functions of these functionals. Moreover, sev- 
eral interesting integral identities are obtained by exploiting a cor- 
respondence between the mean of a Poisson-Dirichlet process and 
the mean of a suitable Dirichlet process. Finally, some distributional 
characterizations in terms of mixture representations are proved. The 
usefulness of the results contained in the paper is demonstrated by 
means of some illustrative examples. Indeed, our formulae are rele- 
vant to occupation time phenomena connected with Brownian mo- 
tion and more general Bessel processes, as well as to models arising 
in Bayesian nonparametric statistics. 

1. Introduction. Let (Pi)i>i, with Pi > P 2 > ■ ■ ■ > and EfcLi Pk = 1> 
denote a sequence of (random) ranked probabilities having the two-parameter 
(a, 6) Poisson-Dirichlet law, denoted by PD(a,#) for < a < 1 and 9 > 0. A 
description, as well as a thorough investigation of its properties, is provided 
in [37]; see also [31, 32] and [35]. Equivalently, letting 14, for any k > 1, 
denote independent random variables such that 14 has Beta(l — a, 6 + ka) 
distribution, the PD(a,c?) law is defined as the ranked values of the stick- 
breaking sequence Wi=Vi,W k = V k Yl^Zlil - Vj) for k>2. Interestingly, 
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PD(a,#) laws can also be obtained by manipulating random probabilities 
of the type Pi = Ji/T, where T = Ya^i Ji an d the sequence (Jj)i>i stands 
for the ranked jumps of a subordinator. If the Jj's are the ranked jumps 
of a gamma subordinator considered up to time 9 > 0, then the total mass 
T has a gamma distribution with shape 6 and scale 1, and (Pj)i>i follows 
a PD(0, 9) law. At the other extreme, letting the Jj's be the ranked jumps 
of a stable subordinator of index < a < 1, (Pj)j>i follows a PD(a,0) dis- 
tribution. For both a and positive, the PD(a,#) model arises by first 
taking the ranked jumps governed by the stable subordinator conditioned 
on their total mass T and then mixing over a power tempered stable law 
proportional to t~ s f a (t), where f a (t) denotes a density, with respect to the 
Lebesgue measure on M, of an a-stable random variable. We further recall 
that there is also the case of PD(— k, itik), where k > and m = 1,2, 
which corresponds to symmetric Dirichlet random vectors of dimension m 
and parameter k. All of these models represent natural extensions of the im- 
portant one-parameter family of Poisson-Dirichlet distributions, PD(0,#), 
which is closely connected with the Dirichlet process. 

Specifically, the corresponding PD(a,#) random probability measures are 
defined as follows. Independent of the sequence (Pj)j>i, or equivalently to 
the stick-breaking weights (Vi)j>i, let (Zi)i>\ denote a collection of indepen- 
dent and identically distributed (i.i.d.) random elements which take values 
in a Polish space X endowed with the Borel a-algebra S£ and have com- 
mon nonatomic distribution H. One can then construct a PD(a,#) class of 
random probability measures, as 

oo oo 

M-) = E%»(-)=E^fe*(-)- 

k=l k=l 

When a = 0, this is equivalent to the Dirichlet process which represents a 
cornerstone in Bayesian nonparametric statistics; see [10, 11, 12]. The law 
of P a fi may be denoted as ^ (ct e) (-\H). In particular, a random probability 
measure with distribution £P._ k .(-\H) can be represented as 

m p 

(i) =£-^0. 

i=i G 

where G = YliLi Gi an d the GVs are independent with Gamma(K, 1) distri- 
bution, which in our notation, means that a density function for Gi is of the 
form [T(K)]" 1 x K ~ 1 e~ x for any x > 0. In [33], one can find a description of 
this model as Fisher's model; see also [19] for more references. 

The choice of P a ^ for a > 0, or of P_ KjmK , has attractive features which 
make them viable models for Bayesian nonparametric analysis, as shown 
in [5, 33], and [18, 19]. However, for the case a > 0, most investigations 
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of PD(a,#) laws appear in applications related to excursion/occupation 
time phenomena, as outlined in [37, 38] and, more recently, to combinato- 
rial/probabilistic aspects of coalescent and phylogenetic processes; see [35] 
and [4] for numerous references along this line of research. 

This paper studies the laws of mean functionals of the PD(a, 6) class. We 
also briefly address the case PD(— k, run), which, as we shall show, essen- 
tially follows from the case of the Dirichlet process. In particular, for any 
nonnegative- valued function / such that P a ,e(f) is finite, we obtain explicit 
formulae for the density and the cumulative distribution function (c.d.f.) of 
linear functionals 

„ oo oo fc— 1 

(2) P a , e (f)= / f{x)P afi {dx) = Y J Pkf{Z k ) = Y J f{Z k )V k \{{l-V J ). 

X k=l k=l j=l 

As such, we extend analogous formulae for Dirichlet processes, corresponding 
to the case of a = 0, given by [6]. We do this by first resorting to the Cauchy- 
Stieltjes transforms of order 6 for P a ,e(f), as derived in [44, 45], and also 
to a transform of order 6 + 1 deduced from [21], where, in particular, 9 = 
for P a ,o(f)- We then apply an Abel- type inversion formula described in [42] 
and finally combine those results with mixture representations of P a ,e(f) 
laws. We note that the case of P a ,o(f) for general / is the most tractable, 
yielding explicit and simple expressions for the densities and c.d.f. which are 
expressed in terms of Abel transforms of H. The fact that our results have 
a strong connection to Abel transforms should not be totally surprising in 
view of the work in [14] where the laws of integrals of Bessel local times are 
investigated. 

Interest in the results we are going to display and prove might arise in 
various contexts. In the paper, we will focus on specific issues related to (i) 
properties of the paths of Bessel processes and (ii) Bayesian nonparametric 
statistical inference. As for the former, we obtain results for pairs of pa- 
rameters of the type (a, 0) and (a, a), thus achieving useful expressions for 
the distribution of the lengths of excursions of Bessel processes and Bessel 
bridges. For example, we recover the important special cases of PD(l/2,0) 
and PD(l/2, 1/2) which correspond to lengths of excursions of Brownian 
motion and Brownian bridge, respectively. As for (ii) above, knowledge of 
the probability distribution of P a ,e{f) can be useful for prior specification 
in applications where one is interested in making inference on a mean of the 
Poisson-Dirichlet process. However, apart from these two areas of research 
witch we are going to describe in more detail in the sequel, it is worth men- 
tioning other potential applications of our results. For example, in [9, 25] 
and [45], it has been shown that results on means of the Dirichlet process 
have implications and interpretations relevant to, for example, the Markov 
moment problem, continued fraction theory and exponential representations 
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of analytic functions. Since the PD(0, 9) model can be seen as the limiting 
case of the PD(a,#) distribution, as a — > 0, we expect that some aspects 
of our work may also be applicable to these areas. Two other important 
applications of the PD(0,#) process for which our results could be of some 
interest relate to random combinatorial structures (see [2] for an exhaustive 
account) and population genetics. 

The outline of the paper is as follows. In Section 2, we describe two areas 
of investigation for which our results are relevant and state an interest- 
ing distributional identity connecting PD(a,#) and PD(0,6>) means. In Sec- 
tion 3, we show how to use an inversion formula for Cauchy-Stieltjes trans- 
forms in order to determine a density function of P a fi{f), as (a, 8) varies in 
(0,1) x [0, +oo). In Section 4, these general results are applied in order to 
determine generalized arcsine laws corresponding to mean functionals of a 
PD(a,0) process: we show how to recover a well-known result and provide a 
representation for a density of / xP a fi(dx) when E[P ai e] coincides with the 
uniform distribution on (0,1). Section 5 provides exact forms for a density 
of P a ,e{f) f° r an Y choice of / which makes the random mean finite almost 
surely. In Section 6, a few distributional identities are given which prove to 
be useful in order to evaluate the distributions of means of PD(a, 1 — a) and 
of PD(a, a) processes. Finally, Section 7 describes an algorithm for exact 
simulation whose formulation is suggested by results illustrated in Sections 
2 and 6. All proofs are deferred to the Appendix. 

We conclude this introductory section by recalling a useful fact. Indeed, 

note that P a ,e(f) = J x Pag(dx), where both P a g and P* 9 are Poisson- 

Dirichlet processes with E[P Q ,e(-)] = H (') and E K,e(0] = #°/ _1 (0 =: n(-). 
This explains why, with no loss of generality, we will confine ourselves to 
considering simple random means of the type M a ^g(r]) := / xP at g(dx); see 
[41] for this line of reasoning. Moreover, the assumption of diffuseness of H 
has been made only for consistency with the typical definition of the Poisson- 
Dirichlet random probability measure. Obviously, rj might have atoms, as 
will be seen in most of our examples; in any case, our results are still valid. 
Finally, in the sequel, C v will denote the convex hull of the support of i], 
that is, C v := co(supp(?7)). 

2. Related areas of application. As already highlighted in the previous 
section, the main results achieved in the present paper find immediate ap- 
plication in two seemingly unrelated areas of research: the theory of Bessel 
processes and Bayesian nonparametric statistical inference. Below, we pro- 
vide a brief description of the connection. 

2.1. Occupation times for Bessel processes and models for phylogenetic 
trees. For functionals P 0t g(f), the generality of the space X is important as 
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it allows one to formally describe phenomena where, for instance, X denotes 
path spaces of stochastic processes. Surprisingly, for general (a, 6), very little 
is known about the laws of the simple, but important, case of P a ,e(Jc) where 
Ic is the indicator function of set C € 3£ , satisfying E[P a> e(Ic')] = H(Ic) = 
p £ (0, 1). Hence, f(Z) = Ic(Z) is a Bernoulli random variable with success 
probability p, otherwise denoted Bernoulli (p). In what follows, we will also 
let P a fi{C) stand for the random mean P a ,e(^c) and H(C) = H(Ic). Using 
the representation provided in (2), one obtains 

oo k—l 

PaAC) = Y, Y k V k]l( l - V i)' 

k=l j=l 

where (Yfe) are i.i.d. Bernoulli(p). The simple case corresponds to Po,e(C), 
which, since the Dirichlet process arises as a normalized gamma process, 
is well known to be a Beta(#p, 9(1 — p)) random variable. This is apparent 
from the fact that if we set Z a to be a gamma random variable with density 

function at x given by [T (a)]" 1 x^ 1 e~ x I K + (x) , then P 0) g(C) = Zg p / \Zg p + 
Zgp), where Zg p and Zg p are independent and p = 1 — p. The other known 
case corresponds to P a ^(C), which has the Cauchy-Stieltjes transform 

(3) E [( i + ^ (c)n- (1+zrlp+p " 



(1 + z) a p + p 
7], yielding, a 

pps'm(aTr)x a ~ 1 (1 — x) a_1 I( 0i i)(3;) 



Such a transform has been inverted in [27], yielding, as a varies in (0,1) 
the densities 



( 4 ) Q a ,o( x ) n\fx 2a +p 2 (l-x) 2a + 2ppx a (l-x) 0l cos(a%)Y 

otherwise known as generalized arcsine laws. It is worth noting that this, as 
discussed in [3, 36] and [38], also corresponds to the fraction of time spent 
positive by a skew Bessel process of dimension 2 — 2a. Following [38], let Y = 
(Yf,t> 0) denote a real- valued continuous process such that (i) the zero set 
Z of Y is the range of an a-stable subordinator and (ii) given \ Y\, the signs 
of excursions of Y away from zero are chosen independently of each other 
to be positive with probability p and negative with probability p = 1 — p. 
Examples of this kind of process are: Brownian motion (a = p= 1/2); skew 
Brownian motion (a = 1/2 and < p < 1); symmetrized Bessel process of 
dimension 2 — 2a; skew Bessel process of dimension 2 — 2a. Then, for any 
random time T which is a measurable function of \Y\, 

(5) A T = j T I {0t+oo) (Y s )ds 

denotes the time spent positive by Y up to time T. Furthermore, remarkably, 
A T /T = A t /t = Ai= A and A = P afi {C). We see that the case of a = 1/2, 
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in (4) is the density found by [24] for the fraction of time spent positive by a 
skew Brownian motion. Moreover, when p = 1/2, this coincides with Levy's 
famous result yielding the arcsine law for Brownian motion. That is, when 
p = 1/2, the random probability -Pi/2,o(C) has a Beta(l/2, 1/2) distribution; 
see [28]. 

In [38], it is also shown that the fraction of time spent positive by a 
skew Bessel bridge of dimension 2 — 2a corresponds to the law of P a ,a(C). 
This random variable also arises, among other places, in Corollary 33 of 
[34]. Another recent instance is that of P aj i_ Q (C), which equates with the 
limiting distribution of a phylogenetic tree model described in Proposition 20 
of [16]. However, results for these models are only well known for a = 1/2, 
which corresponds to skew Brownian bridges. In particular, setting p= 1/2 
yields the Levy result for Brownian bridge which implies that -Pl/2,i/2(C) is 
uniform on [0, 1]. A density for Pi/2,o(C) and general p has been obtained by 
several authors; see, for instance, equation (3.4) in [5]. The case of (1/2,0) 
when p = 1/2 is then Beta(# + 1/2, 6 + 1/2); see also equation (65) in [1] for 
the density of -Pi/2,i/2(C) for general p and yet another application related 
to the law of P QiQ: (C). 

While the cases of Bernoulli Y^'s are indeed quite interesting, we do wish 
to reiterate that it is substantially more difficult to obtain results for the 
more general case where the Y^'s have a general distribution r/. 

2.2. Bayesian nonparametric statistics. The topic of this paper can be 
naturally connected to a large body of literature in Bayesian nonparametrics 
which is aimed at investigating the probability distribution of functionals of 
random probability measures. Besides the pioneering work in [6], we men- 
tion: [7] and [30], where nonlinear functionals of the Dirichlet process are 
studied; [29, 40] and [17], which provide developments and refinements of 
the earlier results in [6]; [21] and [41], which yield distributional results for 
means of a class in random probability measures that generalize the Dirichlet 
process. The interest in random probability measures in Bayesian nonpara- 
metric statistics is motivated by the fact that they define priors on spaces 
of probability distributions. 

Here, we briefly describe the contribution in [6] since it has inspired our 
own approach. A first result contained in that paper consists of an important 
formula for the generalized Cauchy-Stieltjes transform of order 9 of the 
mean functional Po.e(f) of the Dirichlet process Pq^ with parameter measure 
9H . Supposing that / : X — > R is such that / x log(l + \f(x)\)H{dx) < 00, the 
Cifarelli and Regazzini [6] show that 



(6) 



E 



1 



e -6 J x lo S (z+f(y))H (dy) = g -0 J R lo g (z+x)r,(dx) 



\.(Z + P ,e(f)) 



e 
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for any z € C such that | arg(z)| < ir and t) = H o f 1 . The expression in 

(6) establishes that the Cauchy-Stieltjes transform of order 6 of Po,e{f) is 
equivalent to the Laplace transform of Ge(f), where Po,e{f) = Ge(f)/Gg(l) 
and Gq is a gamma process with shape OH. The importance of (6) in differ- 
ent contexts was recognized by [9, 25] and [45]. In this regard, it is called 
the Markov-Krein identity for means of Dirichlet processes. It is called the 
Cifarelli-Regazzini identity in [22]. With considerable effort, Cifarelli and 
Regazzini [6] then apply an inversion formula to (6) to obtain an expression 
for the distribution of / xPqq (dx) as follows. Let q e denote the density of 

/ xPog(dx), where Pq^q is a Dirichlet process with parameter measure 6rj, 
set ^(x) := rj((0,x}) for any x > 0, and let 

(7) R(t)= f log(\t-x\) V (dx). 

JR+\{t} 

Then, from [6] (see also [7]), one has, for 6 = 1, 

(8) q Q 1 (x) = - sm(TT^(x))e- R ^ 
and when 6 > 1 , 

(9) q oe (x) = (6-l) f X (x-t) e - 2 -sm{n6^(t))e- m ^dt. 

Jo vr 

Additionally, an expression for the c.d.f., which holds for 6^> not having 
jumps greater than or equal to 1, is given by [6] as 

(10) / X (X - t) 9 ' 1 - sm{7r6^{t))e~ 9m dt. 
Jo vr 

In particular, 10 holds for all 6 > if r] is nonatomic. We note that while there 
are various formulae to describe the densities of / xPo ) 0(dx), descriptions for 
the range < 6 < 1 prove to be difficult; see, for example, [6, 40] and [29]. 

Here, we provide a new description for the density, which holds for all 6 > 
0. This result will be obvious from our subsequent discussion concerning the 
inversion formula for the Cauchy-Stieltjes transform and otherwise follows 
immediately from (10). 

Proposition 2.1. Assume that rj admits a density on IR + and suppose 
that R defined in (7) is differentiable. Then, the density of the Dirichlet 
process mean functional J xPq $(dx) may be expressed, for all 6 > 0, as 

(11) Q oe (y) = - riy-tf^dg^dt, 

vr Jo 

where 

(12) d e , v (t) = ^sm(M(t))e- eR ^. 
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It is apparent that practical usage of these formulae require tractable 
forms for R and its derivative, which are not always obvious. 

We close the present section with an important distributional identity 
which connects P a ,e{f) with mean functionals of the Dirichlet process. For 
a measurable function / :X — > R + such that H{f a ) < oo, let Q a ,o denote the 
probability distribution of P a ,o(f)- This means that / x log(l + x)Q (dx) = 
E[log(l + -Pa,o (/))]• The last expression is finite, since J x f a (x)H(dx) < oo. 
Hence, xP^fiidx) < oo almost surely. From Theorem 4 in [45], note that, 
for any z such that \z\ <tt, 

(13) exp(-e^ + log[z + x}Q ai0 (dx)^ = U^[z + x] a r)(dx)\ ' 

From the Cifarelli-Regazzini identity (6), the left-hand side in (13) coincides 
with the generalized Stieltjes transform of order 9 of the random Dirichlet 
mean Mo i e(Q C(0 ), whereas the right-hand side is the generalized Stieltjes 
transform of order of P a ,e(f)- These arguments can be summarized in the 
following theorem, which states a distributional identity between the mean 
L+ xPofi(dx) of a Dirichlet process and a suitable linear functional of a 
PD(a,#) process. 

Theorem 2.1. Let f :X^R + be a measurable function such that H(f a ) < 
oo, where a £ (0, 1). If Q a stands for the probability distribution of P a ,o(f) , 
with P a fi such that E[P ai o( - )] = H{-), then 

(14) P a ,e(f) = I xP , e (dx), 

where Pq : q is a Dirichlet process with E[P ,6>( - )] = Q a ,a(') an d the symbol = is 
used to denote equality in distribution. Moreover, the probability distribution 
of P a ,e(f) is absolutely continuous with respect to the Lebesgue measure on 
R. 

It is worth noting that an alternative proof of (14) can be given, based 
on a construction of Gnedin and Pitman [13]. 

The last part of the statement in Theorem 2.1 — that is the absolute conti- 
nuity of the probability distribution of P a fi{f) with respect to the Lebesgue 
measure — can be deduced from the absolute continuity of the probability 
distribution of f R+ xPo i e(dx) which is proved in Proposition 2 of [29]. Hence, 
the probability distribution Q a fi of P a ,e(f) has a density with respect to the 
Lebesgue measure on R, which we denote q a ^. 
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3. The probability distribution of P a ,e(f)- The present section provides 
a general expression of the density function of the mean of a two-parameter 
Poisson-Dirichlet distribution by resorting to an inversion formula for the 
(generalized) Cauchy-Stieltjes transform of order 9 > 0. Before getting into 
the details, let us introduce some new notation that will be used henceforth. 
Let H be some nonatomic distribution on (X, SC) and let / : X — ► M + be any 
function in the set 

g a (H) :={f:X^R + s.t. / is measurable and H(f a ) < +oo}. 

Moreover, as anticipated in the Introduction, P a ^ denotes a random prob- 
ability measure with law ^, 9 A-\H). We confine our attention to func- 
tions in (o a (H) for two reasons. First, the integrability condition H{f a ) = 
j^f a {x)H(dx) < +oo is necessary and sufficient for the (almost sure) finite- 
ness of P a ,o(f)> see Proposition 1 in [41] for a proof of this result. Hence, one 
can use the absolute continuity of £P (a e) (-\H) with respect to & > , A-\H') 

in order to obtain P a ,o(f) < oo with probability 1. Second, we consider only 
nonnegative functions since the inversion formula we resort to applies to 
functionals P a fi{f) for a measurable function /:X— 

Given a function / in £ a {H), the transform of order 9 > of P a ,e(f) i s ) 
for any z £ C such that | arg(z)| < ir, 

-6/a 



(15) Sg[z;P at g(f)]=E 



1 



[z + f(x)] a H(dx) 



.(z + P a , e (f)) 6 

Such a representation is to be attributed to [26] and also appears, with 
different proofs, in [44] and [45]. This transform turns out to work well in 
the case where 6 > 1. Additionally, we will need the transform of order 9 + 1, 
that is, 

(16) S e +i[z;Pa,eU)\ ~ 



{f x [z + f(x)]<*H(dx)y/°+i- 

In particular, for 9 = 0, we have, importantly, the Cauchy-Stieltjes transform 
of order 1 of the PD(a,0) mean functionals, 

(ir) s l[z ,p a M}- k[z + m]aH{dx) ■ 

The transforms (16) and (17) can be obtained as special cases of Propo- 
sition 6.2 in [21] with n = 1. Moreover, for 9 > 0, (16) can be obtained 
by taking a derivative of (15). When inverting (15) or (16), one obtains 
the probability distribution of P a ,e{f)- Since Theorem 2.1 implies that the 
probability distribution of P a ,e(f) is, for any (a, 9) G (0, 1) x IR + , absolutely 
continuous with respect to the Lebesgue measure on K, we obtain the density 
function, q a of P a ,$(f). 
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The particular inversion formula we are going to use has recently been 
given in [42] ; see also [43] . In [8] , one can find a detailed account of references 
on inversion formulae for generalized Cauchy-Stieltjes transforms. 

Theorem 3.1. Let f be a function in £ a (H) and define 

(18) A a>e (t) := lim{S fl [-t - ie; P a ,etf)] - S e [-t + ie; P a ,« (/)]}• 

Z1T1 ej.0 

The density function q a g ofP a $(f), evaluated at a point y inC v = co(supp(r/)) ; 
then coincides with 

(19) ?„»= j\y-t) e ~ l A ! afi {t)dt. 

Jo 

When 6 > 1, the expression above can be rewritten as 

(20) g a g (y) = (9-1) \\y - t) e ~ 2 A afi {t) dt. 

Jo 

It is worth noting that if 9 = 1, then q a ^i = A a x, thus yielding the same 
result as in [46] . The case corresponding to 9 < 1 can also be dealt with by 
computing the transform Sq+i[z; P a #(/)]. One then obtains 

(21) q aB (y) = 9 [\y - tf-^e+i® dt, 

Jo 

where 

*a,8+m ■ 2n . _ . £ + f{x))aH{dx)]e/a+ i 

(22) 

f x [-t + is + f{x)r^H{dx) 



[k{ _ t + i£ + f{x)rH{dx)] e/ a+ i 
Note that the formulas (19) and (21) lead to the almost everywhere equality 

(23) A' a = 6A afi+l 

for 6 > 0. Finally, note that A Qi i is, by Widder's inversion, the density of 
P a ,o(f)- Hence, a first approach for the determination of the distribution of 
Pa d{f) w iH a i m & t the determination of A a g and A a g + i. This task will be 
completed in the following sections. Note that once we obtain an explicit 
form for A a ^, the c.d.f. of P a ,e(f) is given by 

(24) Q M-oo,x])= [ X qJy)dy= [* (x — y) e ~ 1 A a>e (y) dy 



for all 9 > 0. This result follows by using the representation in (19) and 
applying integration by parts. As we shall see, this representation plays a 
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key role in obtaining simpler expressions and various identities for Aq,#, 
hence simplifying the formulas for the densities. 

Finally, we anticipate some notation that will be useful. First, consider 

poo pt 

^Jdit) =] ( x ~ t) d v{dx) and j^ 4 (t) = J (t - x) d n(dx), 
which represent generalized Abel transforms of the measure r/. Now, define 
7 Q (t) = cos(a7r).0^ ja (t) + srf£ a (t), 

C a {t) =8in(at7r)&C,a(t), Pa,e(t) = - arc tan + — Ir«(i), 

a 7a{t) a 

where T a := {t G R + : y a (t) < 0}. Clearly, when a < 1/2, y a (t) > for all t. 

4. Generalized arcsine laws. We first deal with linear functionals of the 
PD(a,0) process. As such, we generalize the results of Lamperti [27] for 
the case of P a ,o(C). We also point out that Regazzini, Lijoi and Priinster 
[41] obtain an expression for the c.d.f. of M a fi{rj) by exploiting a suitable 
inversion formula for characteristic functions and, additionally, provides ex- 
pressions for its posterior density. Here, the approach we exploit leads to 
explicit and quite tractable expressions for the density which is expressed in 
terms of Abel transforms of 77. Moreover, we also derive new expressions for 
the c.d.f. which can indeed be seen as generalized arcsine laws. 

Theorem 4.1. Let 7/ be a probability measure on (X, 3£) with X C M + 
and set Q a>v := {y£R + : f % \y - t\ a - l n{dt) < +00} nC v . If J t a n(dt) < +00 
and the Lebesgue measure of is zero, then a density function of the 
random variable M Qj o(r/) = / xP a o(dx), denoted by q a0 , coincides with 

(25) q (y) - sin ( a7r ) ^la^)^-!^) + ^a-M^vAv) 

71 Kta(y)] 2 + 2cos(avr)j4+ {y)^ a {y) + K, a (y)] 2 

for any y G 9 afl . 

The proof is provided in the Appendix. The result for the form of the 
density is new. We are also able to obtain, in view of obvious difficulties 
with direct integration, a rather remarkable expression of the c.d.f., given in 
the next theorem. 

Theorem 4.2. Let n be a probability measure on ]R + such that f x a r](dx) 
is finite and the Lebesgue measure of the set Q c a ^ is zero. If 1 1— ► ^?(t) = 
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T}((— oo,t]) is Lipschitz of order 1 at any y G Qir) , then the c.d.f. of M a fi(ri) 
is given by 

1 . fCa(x) 



(26) Q a0 ((-oo,x]) = — arctanf 

air \ 

for any x in C v and a in (0, 1). 



7a 0*0 



Applying Theorem 4.2 to the case a = 1/2, we obtain the following result. 

Corollary 4.1. Consider the setting as in Theorems 4-1 o,nd 4- 2. The 
density of the random variable M 1 / 2 ,o( r /) * s given by 

, _, f v i<i /2 (y)<-i/2(y) + <ii/2(y)<i/2(y) 

( 27 ) 9 1/2 , (y) = - — - — — 2 — r ^ 2 

[<i /2 (y)] +Ki/2(y)] 

/or any y G & a ri an d its c.d.f. is given by the generalized arcsine distribution 

2 

(28) Q 1/2)0 ((-oo,x]) = -arcsin(7rA 1/2il/2 ( 2; )[7 1 2 /2 (x) + Ci 2 /2 (^)] 1/2 ). 

Remark 4.1. As we see, the results for the PD(a,0) are tractable and, 
quite remarkably, only require the calculation of the Abel transforms sd^ a. 
and £&^ a . In this regard, one can, in general, obtain explicit results much 
more easily than for the case of the Dirichlet process. It is worth pointing 
out once again that our expressions for the c.d.f. show that these models 
have indeed generalized arcsine laws. These expression are rather surprising 
as it is not obvious how to integrate with respect to the densities. 

Below, we illustrate a couple of applications of Theorem 4.1. The first 
example recovers a well-known result given in [27], while the second provides 
an expression for the density q a when the parameter measure rj coincides 
with the uniform distribution on the interval [0, 1] . 

Example 4.1 (Lamperti's occupation time density). Here, as a quick 
check of our results, we first revisit Lamperti's model. That is to say, the 
distribution of P a ,o(C). This corresponds to r\ being the distribution of a 
Bernoulli distribution with success probability p = E[P a> o(C)] = 1 — p. It fol- 
lows that for any d > 0, the Abel transforms for a Bernoulli random variable 
are given by 

^dit) = ( l ~ t) d P and ^d{t) = t d p 

for any t in (0,1). Hence, one easily sees that Lamperti's formula in (4) is 
recovered. In addition, we obtain the following new formula for the c.d.f.: 

1 / s\ii(i:a)x a p \ 

Q a , ((-°°> x ) = — arctan — 

air \cos{aTT)x a p + (1 — x) pj 
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for any x G (0, 1). This may also be expressed in terms of the arcsine, using 
the fact that for any t G (0, 1), 

, . . , . sm(air)t a p 

29) A aa (t) = — — » . 

' vr[t 2 V+2cos(a7r)^(l-tfpp+(l-t) 2 V] 

Example 4.2 (Uniform parameter measure). We again note that, while 
there are several techniques one could have used to derive expressions for the 
functional P a ,e(C), it is considerably more difficult to obtain results for a 
more general choice of P a ,o{f)i with / in S a {H). Here, we demonstrate how 
our results easily identify the density in the case where rj(dx) = I( ,i)( a; ) dx. 
For M a fi{rf) = J xP a ^(dx), direct calculation of the Abel transforms leads 
to the expression of its density as 

/ s = (a + l)sm(air)y a (l-y) a 

qao{V) air[y 2a + 2 + (l-y) 2a + 2 +2cos(aw)y a + 1 (l-y) a + 1 ] (°' 1 ) lyj ' 

Note that one easily finds -y a (t) = {t a+l cos(avr) + (1 - t) a+l )/(a + 1) and 
Ca(t) = t a+1 sin(a7r)/(a + 1), additionally providing an expression for the 
c.d.f. In the Dirichlet case, the distribution of f, ^ xPo t g(dx) can be deter- 
mined by means of results contained in [6] and it is explicitly displayed in 
[9]. Its density function on (0, 1) has the form 



q oe (y) = -(l-y)-^y-y S m(<Ky)I m (y). 



5. The probability distribution of PD(q, 6) random means. In the present 
section, we are going to consider more general cases than the a-stable pro- 
cess dealt with in the previous section. In particular, we illustrate the results 
one can achieve by applying an inversion formula provided in [42] in order 
to obtain the density of M a g(r]) when a £ (0, 1) and 6 > and then make a 
few remarks concerning the case in which a < 0. 

5.1. Densities for M a g(r]). As suggested by the inversion formula, the 
evaluation of a density function for the random functional M a ^ basically 
amounts to the determination of the quantities A a g and A a +1 . We then 
move on, stating the two main results of the section and providing an inter- 
esting illustration. 

Theorem 5.1. For any t G 6 afl and (a, 9) G (0,1) x E + , one has 

(30) A a , 6 (t) = ^2(^*2(^3/(2.*) MpaAt))- 
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A combination of the above result with Theorem 4.2 leads to another 
representation for the c.d.f. of the mean P a ,o(f) of an a-stable subordinated 
First, note that, for any a £ (0, 1), one has 

PaA t ) = arc tan —J7\ + Klr a (t) = arc sin + nI Ta (t) , 

where sign(a) = 1 if a > and sign(a) = — 1 if a < 0. At this point, from 

(30) , one has 

= sign( 7a (t)) Ut)ha(t)\ = Q a (t) 

If, as in the statement of Theorem 4.2, one has that VP is Lipschitz of order 
1 at any y £ ® a ,7i, an alternative representation for the distribution function 
of a normalized a-stable random mean displayed in (26), that is, 

(31) Qa,o((-°o, x\) = — arc sin(7r A a a (x) J (x) + 7^ (x) ) , 
holds true. 

Remark 5.1. As pointed out at the end of Section 3, if a £ (0,1/2], 
then p a ,e(t) = (6*/a) arc tan(Co(t)/7a(0)- Hence, if one resorts to the ex- 
pression for the c.d.f. of P a fl{f) as provided in (31) it can be noted that 
Pa,e(t) = ^^Qafid— 00, t]) and this yields a representation equivalent to the 
one displayed in (30), that is, 

sin(07rQ a)O ((-oo,t])) 



(32) A afi (t) 



A further evaluation of A a ^ can be deduced by resorting to the correspon- 
dence between PD(a,#) means and Dirichlet means, as stated in Theo- 
rem 2.1. Indeed, one can prove the following useful identities. 

Theorem 5.2. Let R a (t) = J R+ x r-logli — y\Q a>0 (dy). Then, for all 9 > 
0, the following results hold: 

(i) A a>e (t) =ir- l sm(Tr0Q aM ((-oo,t]))e- eRa{t) ; 

(ii) for any t € {x : Q afi {(— 00, x\) > 0} and a £(0,1), 

A a , a (t)vr ^ a 



e ~Ra(t) 



sin(vraQ Qi0 (t)) 



The expression of A a ^, as determined in Theorem 5.1, Remark 5.1 or 
Theorem 5.2(i), is useful when one aims to evaluate the density function of 
P a fi{f) corresponding to the case 6 > 1. When 6 < 1, one must resort to the 
expression given in (21) and then the evaluation of A a> Q+i is necessary. The 
following theorem deals with this issue. 
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Theorem 5.3. For any (a, 9) G (0, 1) x R + and t G Q a ,r], the following 
identity holds true: 

7a_i(i) sin(p Qi0 (t)) - ( a -i(t) cos(p a; e(t)) 



(33) A at e+i(t) 



We now proceed to provide a simple illustration of the above results by 
means of an example. More detailed discussion about the determination of 
the probability distribution of M at g(r]) is developed in the following section. 

Example 5.1 [First expressions for P at g(C)]. It is interesting to compare 
the general case of P a ,e{C) with that of Lamperti's result in Example 4.1. 
Here, using the specifications in that example, it follows that 

( 4) a ()= sm(f arctan( pcoB ^^_ t)a ) + f ^ffl) 

1 ' a ' e{) KipH^+piii-tya + ippcosia^ii-tyy/^)' 

where T a = if a G (0,1/2], whereas r a = (0, ^-) with v a = {-pj 

(pcos(a7r))) 1 / Q if a G (1/2,1). From (34), one can recover the expression 
for the c.d.f. of P a ^(C) by resorting to (24) if the parameter 9 > 1. When 
9 < 1, expressions for A Q ^ +1 can also be calculated explicitly, leading to 
formulae for the density. In general, it is evident that such results are not 
as amenable as the case P a fi(C), although they still lead to interesting in- 
sights. We will see that a case-by-case analysis can lead to more explicit 
expressions. We also develop other techniques in the forthcoming sections. 

5.2. The probability distribution of M_ K ^ mK (r]). In this section, we estab- 
lish the law of the mean functional of a random probability measure with 
distribution & > ,_ K , {-\H) which, according to (1), is given by 



i=l G i=l G 



where the Y^s are i.i.d. with common probability distribution n. One rea- 
son to study these functionals is that for the choice of k = 9/m, one has 
that M_Q/ m Q(rj) converges in distribution to M 0j e(r/) as m— > oo. This fact 
may be found in, for example, [20]. It is easy to see that, conditionally on 



(Yi, . . . ,Y m ), M- KmK (<q) = M 0jmK (7? m ), where 



1 m 

(35) %*(•) = -!>< 

i=i 



is the empirical distribution. Thus, descriptions of the conditional distri- 
bution of M_ KimK (?7), given iXi)i>ii follow from (8), (9) and (10) for ap- 
propriate ranges of the parameter 9 = inn, with r/ replaced by rj m . The 
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Cauchy-Stieltjes transform of M- KtTnK (rj) of order mK is 
1 



E 



(z + y) K rj(dy) 



arg(z)| < 7r. 



Xz + M_ K , mK (r)))™ 
Now, set 

where A t) m = {i ■ Vi / t} n {1, . . . , m} and suppose that c mK (t) := /( 0)OO ) I* ~ 
y\~ mK Tj(dy) < oo for almost every £ with respect to the Lebesgue measure so 
that one can define p mK (t) = J (t — y)~ mK rj(dy) /c mK (t) and 

h m ,m K {t) := [c mK (t)] m ^2sm(7TjK) ( J [p mK (t)] j [l - p mK (t)] m ~ j . 

j=l 

This leads to the following interesting result. 

Proposition 5.1. 7/"E„[-] denotes the expected value taken with respect 
to r], then: 

(i) E v [uj^(t)sm(-KmKr] m (t))} =h m)TnK (t); 

(ii) when mn = 1, the density of M_\i m ^(rj) is given by h m> i(x)/ir; 
(hi) for K = e/m<l, the c.d.f. of M_ e / mfi {r]) is 

^ f X 9 1 

Q-e/m,e({-o<=>, x]) = - (x- 1) ~ h m>& (t) dt. 
vr Jo 

Furthermore, this c.d.f. converges to (10), as m — > oo, for all > 0. 

6. Distributional recursions. In this section, we describe mixture repre- 
sentations which are deducible from the posterior distribution of PD(a,#) 
laws and existing results for the Dirichlet process. These represent aids in 
obtaining tractable forms of the distributions of various models M a ^ (rj) . In 
particular, we will use this to obtain a nice solution for all PD(a, 1 — a) 
models. Before stating the result, let us mention in advance that B a ^ stands 
for a beta-distributed random variable with parameters a, b > 0. 

Theorem 6.1. Let the random variables W, M a ^ +a (r]) and Bg +olt i_ a 
be mutually independent and such that W has distribution r/. 

(i) Then, for < a < 1 and 9 > —a, 

M afi {r]) = B e+aA _ a M afi+a (r]) + (1 - B e+a ,i- a )W. 

Note that when 6 > and a = 0, this equates with the mixture representation 
for Dirichlet process mean functionals. 



FUNCTION ALS OF POISSON DIRICHLET PROCESSES 



17 



(ii) Moreover, for any 9 > and a £ (0, 1), one has 

M^n) = B ejl M a! e(rj) + (1 - B 6>1 )M a>0 (n). 

An immediate consequence of this result is that if one has a tractable 
description of the distribution of M a fi +a {rf), then one can easily obtain a 
description of the distribution of M a g(r]). Moreover, from Theorem 6(i), 
one can deduce representations of P a ,e(f) f° r negative values of 0, once an 
expression of M a ^ +a (rj) is available. 

Remark 6.1. Recall that P\/2,q{C) for p = 1/2 has the arcsine distribu- 
tion Beta(l/2, 1/2). Applying the mixture representation in statement (ii) 
of Theorem 6.1, one can see, via properties of Beta random variables, that 
h/2,e{C) is Beta(0 + 1/2,6 + 1/2). This corresponds to a result of ([7]) for 
MQfi(rf), where n is the arcsine law, although a connection to occupation 
time formula was not made there. 

6.1. Results for PD(a, 1) and PD(a, 1 — a). We are now in a position to 
discuss some of the easiest (and also more important) cases. First, we deal 
with a PD(a, 1) mean functional, M a; i(n), and use it in order to determine, 
via a mixture representation, the probability distribution of M Qj i_ a (r;). 

We have already mentioned that when 9 = 1, the inversion formula sim- 
plifies and the density function of Mi ja (n) reduces to q a ^\ = A a)1 , as given 
in (30). For the range < a < 1/2, this further reduces to 



i \ a ( \ sin( l/« arc sin(7r A Q , a (wg (y) +jj{y))) 
(36) qaM = A a ,(y) = ^ (y) + 7 2 (y)]1/(2 «) • 

If a = 1/n, with n = 2, 3, ... , one can then use the multiple angle formula 



sin(?ix) = ^2 \k) [ cos ( :r )] fc [ s ™( x )] n k s i n (^[ n — , 

in order to simplify the expression of q a ,i- These remarks can be summarized 
as follows. 

Theorem 6.2. A density function of M aj \(rj), for all < a < 1, coin- 
cides with 

sin(l/aarctanC Q (?/)/7a(y) +7r/aIr Q (y)) 

ACi(y) + ii{y)\ 1,la 

for any y £ O a , v - When a = 1/n, with n = 2, 3, . . . , then a density function 
for Mi/ n i(rj) coincides with 
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Setting n = 2 in the last formula yields a simple form for a density function 
of M 1 /2,i(??), expressible as 

> 2 7i/2(y)Ci/2(y) 2 <i/jyKyy) 

<7l/2,lU/j " „ [C 2 /2 ( y) + 7 2 /2(y)]2 " n [C 2 /2 ( y) + 7 2 /2(y)] 2 ■ 

Now, the density of PD(q, 1 — a) mean functionals can be deduced from 
PD(a, 1) models via the mixture representation given in Theorem 6.1. 

Theorem 6.3. A density function of the random mean M Q , j i_ a (ry) is 
obtained via the distributional identity 

M a ^ a (rj) = £i,i_ a M a ,i(T;) + (1 - B 1A - a )W, 

where Bi t i- a , M a ^(rj) and W are independent. Here, W is a random vari- 
able with distribution rj. In particular, the density of M a ^_ a {r]) takes the 
form 

(1 - a) | o °° jT 1 A a>1 ) (1 " by'b-* db V (dw), 

where A aj i(t) = for any t < 0. 

Example 6.1 (A distribution relevant to phylogenetic models). Recall 
from the Introduction that the random variable P a i_ a (C), when E[P a i_ a (C)] 
1/2, is equivalent in distribution to the random variable appearing in [16]. 
It is known that when a = 1/2, the distribution is uniform, according to the 
well-known Levy result; see [28]. Here, we obtain a quite tractable repre- 
sentation of the laws for all values of a and with E[P Q , j i_ Q (C)] = p = 1 — p, 
for any p G (0, 1). To this end, we first obtain the distribution of P a ,i(C). 
This can be easily obtained by setting 9 = 1 in (34), which yields the density 
function 

, , , sin(l arctan( pcoB [^^_ t)a ) + glr a (*)) 

Ja,l,p\ x ) 



7r{p 2 t 2a + p 2 (l - t) 2a + 2ppcos(air)t a (l - i)"} 1 /^) ' 

where T a = if a G (0,1/2], whereas T a = (0, with v a = (~p/ 

(pcos(a7r))) 1 / Q when a G (1/2, 1). Since a density function of P Q) i(C) is 
available, one can evaluate the probability distribution of P a ,x-a(C) via the 
mixture representation stated in Theorem 6.1. It suffices to set 77 = b p , where 
b p (x) =p x (l — p) 1 ~ x I{o^}(x) is the probability mass functions of a Bernoulli 
random variable with parameter p. Hence, one has P a ,i{C) = M a ^(b p ) and 
1 - M Q ,i(6 p ) = 1 - P Q ,i(C) = M a>1 (b p ). 
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Corollary 6.1. Let W denote a Bernoulli random variable with pa- 
rameter p and let W be independent of B\ t i- a and M a! i(b p ). Then, condi- 
tionally on the event W = 1, one has M a ^_ a (b p ) = 1 — Bi t i-. a M a> i(bp) . On 

the other hand, given W = 0, one has M aj i_ Q (6 p ) = Bi t i- a M at i(b p ) . Equiv- 
alently, a density function of M Qi i_ a (6 p ) is obtained via the distributional 
relationship 



fa,l-a,p(t) = (1 - a) / 
J 



Pfa,l,p( - ) +Pfa,l,p 



UJ \ U 



- 1 (l-«)- a dtt. 



6.2. The case o/PD(a,a). The important case of PD(a, a) is, in general, 
more challenging than the case of PD(a, 1 — a). Of course, these two agree in 
the case of a = 1/2 corresponding to quantities related to Brownian bridges. 
Technically, one can apply the formula based on A a ^ Q+ i , but this does not 
always yield very nice expressions. Alternatively, in the special case where 
1 — a = 2a, that is a = 1/3, one might think of using mixture representation 
results such as those given in Theorems 6.1 and 6.2. According to the latter, 
one can determine M a 1—0,(77) and then, by resorting to the former (with 
6 = a), one obtains M a a (rj). For example, one can use the Dirichlet process 
mixture representation to obtain the probability distribution of ^^2/3,2/3(77) 
from the distribution of M2 1 (77) . Additionally, when a > 1/2, one may use 

the density representation of Ma^aiv) based on A a ^ a , coupled with the 
mixture representation. Let us investigate these cases by considering specific 
examples. 

Example 6.2 [Probability distribution of P a , a (C)]- First, note that, 
having set p = 1 — p = 77(C) G (0, 1), the quantity A Qja has been described 
in (29). The expressions for A Qja+ i are the same for any a £ (0,1) since 
sin(2arctan(^|)) = sin(2arctan(^|) + 2vrIr Q (t)). Hence, for any a G (0, 1) 
and t G (0, 1), one has 

,0^ X (A _ 2 7a (t) 7a -i(t)Ca(t) ~ Ca-l(*)7a(*) +Ca-l(*K2(*) 

with 7 a (£) =p(l — t) a +pcos{a-K)t a and Ca(t) = psm(aTr)t a . These findings, 
with some simple algebra, lead to the following corollary. 

Corollary 6.2. The random probability P a>a (C) admits a density func- 
tion coinciding with 



y 1 

a— 1 



, . a»sin(a7r) , 

Xv)= / Mv-t)] 

vr Jo 



x (p z (l-t) 2a -\l + t) 
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(38) + 2ppt a+1 (l - t)*- 1 cos(avr) - p 2 t 2a ) 

x (\p 2 (l-t) 2a +p 2 t 2a 

+ 2ppt a (l - t) a cos(avr)] 2 ) -1 dt 

for any y in (0, 1). 

It is now worth noting that the above formula, with a = p = 1/2, yields the 
well-known result about the probability distribution of A = J I(o,+oo) O^s) ds, 
in the case where the Markov process Y = {Yf-.t G [0,1]} is a Brownian 
bridge. Indeed, Levy found that A is uniformly distributed on the interval 
(0, 1); see [28]. In this case, A 1/ / 2j 3/2(i) = 2ir~ 1 t~ 1 / 2 and the density function 

°f A/2,l/2(C) is g iven by 

iy 2A/ M = ^J o V t- 1/2 (y-tr 1/2 dt = i 

for any y G (0, 1). 

Example 6.3 (Uniform parameter measure). Let us again consider the 
case in which rj(dx) = Ir 01 \(dx). Recall that 7«(<) = (t a+1 cos(a7r) + (1 — 
t) a+1 )/(a + l) and ( a (t) = t a+1 sin(avr)/(a + 1). These yield 

~ . . ,„ . 9 sin(a7r) 
A a , a+1 (t) = (l + a) 2 v ' 

air 

t a [(l- t) 2a+l (l + t) - t 2a+2 + 2 cos(a7r)i Q + 2 (1 - t) a ] 
X [t 2a + 2 + (1 - t) 2a + 2 + 2 cos(a 7 r)^+ 1 (l - t) Q + 1 ] 2 ' 

The expression of the density q a>a somewhat simplifies when a = 1/2. In- 
deed, in this case, one has 



_ 9 ^ t [(l-t) 2 (l + t)-t 3 ] 
A V2,3/ 2 (*) ~ 2vr[l-3t(l-t)] 2 



for any t £ (0,1). In order to determine the probability density function q, 
compute 

r ' sKyj Wo [l-3t(l-t)] 2 



2 (2)n3^ f« _ 1/2r+n+1/2 _ +s dt 
vr ^ n\ Jo 

£ (-1)" j\y ~ tr l ' 2 f +n+k+1 l 2 dt 



n>0 ' fc=0 
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2 v (2) n 3" ^ fn + s\ k n+k+T+l V{r + n + k + 3/2) 
t V & / r(r + n + fc + 2) ' 



where (a) n = T(a + n)/r(a) for any a > and n > 0. Consequently, 

?i/2,i/2(y) = Ay - 0" 1/2 Ai A 3/ 2 (i) * = hM + /i, 2 (y) - h, (v) 



for any y in (0, 1). 

An alternative representation of this density can be achieved by resorting 

to Theorem 6.1. Indeed, one has that Af 1 / 2 ,i/2( r /) = ^1,1/2-^1/2,1 (v) + (1 — 
Bi ij2)Y , where the density function of M1/2, is given by 

9 y 3/2Q _ y )3/2 

?i/ 2| i(y) = A 1/2|1 (y) = ^ {y3 + (1 _ y)3} 2 II (o,i)^) 

and Y is uniformly distributed over the interval (0,1). This then suggests 
that a density of M 1 / 2 ,i/2(v) can be represented as 

21/2,1/2(2/) = ^/ ( X l ~ X3 ^ 1/2 {f. ( X 2 -^3)" 1/2 9l/2,l(^2)dx 2 |d2;3 



2 Jo 

1 Z" 1 

+ 2 



^(x 3 -xi) 1/2 |^ (2:3-3:2) 1/2 Qi/2,i(x 2 )dx2^ dx 3 



9 ^_ /- 1 x 2 (l-x 2 ) 3 / 2 (1 3 iA 

= 2^^ {x3 + (l-, 2 )3 }2 ^ U' 15 2 ; ^ 

9 . f*i x 2 /2 (l-x 2 ) „ /l 3 1-xi. 

+ 2^^U {x3 + (l-x 2 )3 }2 2Fl U' 1; 2 ; r^'^ 2 ' 

where 2 iq is the Gauss hypergeometric function. 



7. Perfect sampling M a ^g(r)). Our results thus far have provided quite 
a few expressions for the densities and c.d.f.'s of M a ^(rj) which are cer- 
tainly interesting from an analytic viewpoint. However, it is clear that if 
one were interested in drawing random samples, it is not always obvious 
how to do so. The clear exception for all r\ is the M a ${r\) case, where one 
can apply straightforward rejection sampling based on the explicit density 
in Theorem 4.1. Here, we show that this fact, in conjunction with the cor- 
respondence to the Dirichlet process established in Theorems 2.1 and 6.1, 
allows us to perfectly sample random variables M a $(r]) for all < a < 1 
and 9 > 0. One simply needs to apply the perfect sampling procedure for 
Dirichlet mean functionals devised by [15]; see also [23] for an application 
of this idea to a class of non-Gaussian Ornstein-Uhlenbeck models arising 
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in financial econometrics. First, recall that Theorem 6.1 establishes the dis- 
tributional identity 

M a , e (v) = M Q , e (Q a>0 ) = M , e (Q a ,o)Be,i + (1 - B e ,i)M a , ( v ). 

Recognizing this, we now recount the basic elements of the perfect sam- 
pling algorithm of [15], tailored to the present situation. First, note that 
perfect sampling can be achieved if < a < M at g(r]) < b < oo almost surely. 
Furthermore, note that this is true if and only if the support of Q a is 
[a, b] or, equivalently, M a fi{rj) E [a, b]. Now, as explained in [15], following 
the procedure of [39], one can design an upper and lower dominating chain 
starting at some time — N in the past up to time 0. The upper chain, say 
uM a: 0(r]), is started at uM a ^_^{rj) = b and the lower chain, iM^^), is 
started at lM a fi-^{rj) = a. One runs the Markov chains for each n based 
on the equations 

uM a> tn+1 (r]) = B nfi X n + (1 - B n ^)uM at e,n(v) 

and 

lM aA n+i(v) = B n>e X n + (1 - B nfi )lM afi , n {ri), 

where the chains are coupled using the same random independent pairs 
(B nj g,X n ), where for each n, B n g has a Beta(l,#) distribution and X n has 

distribution F a ^. That is, X n = M Q , ! o(ry). The chains are said to coalesce 
when D = \uM a fi^ n (rj) — lM a ^^ n {rf)\ < e for some small e. Notice, impor- 
tantly, that this method only requires the ability to sample M Qj o(??), which 
is provided by Theorem 4.1, and an independent Beta random variable. 

APPENDIX A: PROOF OF THEOREM 3.1 

A.l. An inversion formula for the Cauchy Stieltjes transform. In order 
to determine the density function, q a g , of the random mean P a fi(f), one can 

invert the transform Sg[z; P a fi(f)}. Indeed, since q aj g is a density function, 
with respect to the Lebesgue measure, on M + , one has J Q C y~ p q Q fi{y) dy < oo 
for some p > for any c > 0. The inversion formula provided in [42] can be 
applied to obtain 

Q 

(39) q ( y ) = -^[ (l + w ) 6 - l S [yw;P a>e (f)}dw. 

Z7T1 JyV 

In the previous formula, W is a contour in the complex plane starting at 
w = — 1, encircling (in the counterclockwise sense) the origin and ending 
at w = -1, while S' g [yw; P a ,e{f)] = ^S e [z\P a fi{f)]\z=yw li > I, one can 
integrate by parts, thus obtaining 

(40) q (y) = 6 —±y^ j (l +w ) e ^Se[yw;P aie (f)]dw, 
see (18) and (19) in [42]. 
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A. 2. Proof of Theorem 3.1. In order to obtain (19) from the above rep- 
resentation (39), first note that the complex integral can be rewritten as fol- 
lows. Set, for any y in the convex hull of the support of H o / , T y C C to 
be the path starting at w = — y, encircling the origin and ending at w = —y. 
Hence, a change of variable in (39) yields 



,(v) 



i 

27ri Jr 



(y + W y- L S' e [w;P a , 9 (f)]dw. 



For simplicity, introduce the function w i— > h(w) = (y + w) e 1 S' [w; P a ,e(f)] 
and note that S' e [w; P a ,e(f)] = ^e+i[w,P a ,e(f)]^ as described in (16). By 
virtue of Cauchy's theorem, one has 



h(w) dw 



-y-ie rO 

h(w) dw + / h(x — ie) dx 

y+iO J-y 



+ ie 



t/2 . . p—y 

e ls h(ee ls )ds + / h(x + ie) dx + 

-7T/2 J o 



/i(w) dw. 



A few straightforward simplifications lead to 



h(w) dw 



(41) 



o 



+ ie 



[h(—x — ie) — h(—x + ie)] dx 



+ 



tt/2 

-7T/2 

-y+iO 
y+ie 



e ls h(ee is )ds + 
h(w) dw. 



y+iO 



h(w) dw 



In order to determine the behavior, as e J. 0, of the last two summands in (41), 
let us show that the function s i— > h(—y + is) is integrable in a neighborhood 
of the origin. Indeed, one has 



\h(-y + is)\ = \s 



\{f x (-y + is + f(x))<*H(dx)}°/«+i\- 



As for the numerator, one has | Jx(~y + 1S + f( x )) a ~ H{dx)\ < J R+ | — y + 
x + is\ a ~ 1 rj(dx) < J K+ \x — y\ a ~ l r](dx) and this is finite for any y in Q a)V . On 
the other hand, if one sets g±(x,y;s) := \x — y + is| a cos(aarg(a; — y + is)) 
and 52 (x, y; s) := \x — y + is| Q sin(aarg(x — y + is)), the denominator would 
coincide with 



(x — y — \s) a r)(dx) 



9/a+l 



exp|(6»a 1 + l)log(J^[g 1 (x,y;s) + ig 2 (x,y;s)]r)(dx] 
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\ 2 / r- \ 6»(2o)- 1 +2- 1 

9i(x,y,s)rj(dx) ) +(/ g 2 (x,y;s)r](dx) ) 
J \JR+ J 



> 



g 2 (x,y;s)rj(dx] 



We now confine ourselves to the case where s is in (0,£) for some positive 
£. The same reasoning can be applied in the opposite case when s S (— £, 0). 
Since arg(rr — y + is) = arctan(s/(x — y)) +7rl( 0) y) (x), this quantity is a value 
in (7r/2,7r) and, hence, sin(aarg(x — y + is)) > 0. Consequently, if we set 
w a = a[7rI(o 2/3] ( a ) + ( 7r /2)I(2/3 l) ( a )], the denominator is bounded below 
by 

y| a sin( a arc tan - 



y 



> < sm(w a ) / \x — y\ a 7](dx) 
J(o,y] 



6a- 1 +1 



and the latter must be greater than some positive constant, say K y , for any 
y in @ a ,r)- Finally, we have \h(—y + is)\ < K' y s 1 for any s in (0,£). This 
implies that, when we let e j 0, the last two terms in the right-hand side of 

(41) tend to zero. As far as the second summand in (41) is concerned, one 
can also show that it tends to zero since 

(42) lime* e is (y + ee is ) e " 1 e 1 " Q cS m [e e is ; P a , e (f)] ds = 0. 

In order to show this, observe that, for any real number s such that \s\ < vr/2, 
one has \(y + ee^ 13 )] 9 " 1 = {y 2 + e 2 + 2yecos{s)Y e ~^/ 2 < y " 1 , when 9< 1. 
Moreover, one can determine a bound for 

|/ B+ [ee* + x] a -ydx)| 



(43) 



.l-Q 



So+i[ 



(f)}\ 



,1-Q 



|{/ R+ [ee i * + x] a »7(d a ;)} 9 /'H- i r 

As for the numerator in (43), define g £ (x,s) := x 2 + e 2 + 2xecos(s) and 
note that | J R+ [ee ls + x]"" 1 ??^)! < J R+ [&(z, s)]^" 1 )/ 2 < e"" 1 since a < 1 
and, as before, |s| < 7r/2. On the other hand, if we further set d e (x,s) := 
arctan(esin(s)/[x + ecos(s)]), then 



[ee is + x] a T 1 (dx) 



> 



[g £ (x, s)] a/2 cos(ad £ (x, s))rj(dx) 



For any s G (— §, f ) and cc > 0, it can be seen that cos(ad e (x, s)) > cos(a7r/2) 
and g e (x,s) > x 2 . These, in turn, yield the following lower bound for the 
denominator in (43): 



[ee is + x^rjidx] 



d/a+l 



x a 7](dx) 



6/a+l 



K n a > 0. 
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Hence, the expression in (43) is, for any e > and (x, s) G M + x (—§,§), 
bounded by some constant depending only on a and 0. This implies (42) 
and then 



h(w) dw = lim / [h(—x — ie) — h(—x + ie)] dx. 

In order to interchange the limit with the integral, we now show that for any 
y, there exists B y C (0, y), with \(By) = 0, such that for any x G .By, one has 
|/i(— x — \e)\ < /i(x) and h is integrable on (0,y). The same argument can be 
applied to \h(—x + \e)\. If we set 

(44) 7 £Q (x):= / \(t - xf + e 2 ] a/2 cos (a arc tan + a7rI/ ^ (t) ) r](dt), 

JR+ \ t-X V ' ; / 

(45) Ce,a(x) ■= / [(t-x) 2 + e 2 ] a/2 sin(aarctan — \- anI(p jX )(t) )v{dt), 

then 

„_1 [TeVl^+CVl^] 172 



|/i(— x — ie)| = |y — x — ie| 



[7 £ 2 a(^) + C £ 2 a (^)] (0+2a)/(2Q) 



Now, note that [7 2 Q _i(x) + C 2 Q -i( 3; )] 1 ^ 2 < ^ Jr+ I s _ t| a-1 r7(dt), for some 
constant > 0. Furthermore, for any t G [x, +oo), one has aarctan(e/(i — 
x)) G (0,O!7r/2] and for any t G (0,x) it follows that a7r + aarctan(e/(t — x)) G 
(an/2, air). Hence, if a G (0, 1/2], then 7 2 a (x) > M > for any x. On the 
other hand, if a G (1/2,1), then sin(aarctane/(t — x)) < sin(arctane/(i — 
x)) = e/ ^/e 2 + (t — x) 2 for any t > x and 

Ce a{%) = / It — x — ie| a sin( aarctan l^^t) 

J[x,+oo) V t-xj 

— / It — x — ie| a sin( jot + a arc tan )ri(dt) 

J{0,x) \ t-X J 

< e I [{t- x) 2 + e 2 ] (a ~ 1)/2 r](dt) - sm(c a ) f (x - t) a r, (dt) 

J[x,+oo) J(0,x) 



<e a - sm(c a ) ( (x - t) a V (dt) 

J(0,x) 



(0,x) 

where c a = argmin a7r / 2 <c<o7r sin(c). Since x G C v , there exists e* > such 
that, for any e G (0, £*), one has Ce,a(%) < M a < 0. This implies that \h(— x — 
ie)| < M'[y 2 + M"]^- 1 ^ 2 J R+ \x - t| Q_1 ?7(dt) for some suitable positive con- 
stants M' and M" . The proof is then complete if we can show that x i— > 
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J R+ \x — t| a ~ 1 r/(<it) is integrable on (0,y). To this end, note that 

x - t| Q_1 77(dt)dx 



JR+\{x} 



\t-x) a ~ 1 dx + j V {x-t) a - 1 dx\r } {dt) 



(0,2/) U0 Jt 

y 

\a—l 



{t-x) a - l dxr]{dt) 

'(y,+oo) ^0 

and this turns out to be finite since / G £ a {H) yielding / t a r\ (dt) < oo. 

APPENDIX B: PROOFS FOR SECTION 4 

We now prove the main results stated in Section 4 concerning the determi- 
nation of the probability distribution of the mean of a normalized a-stable 
subordinator. 

B.l. Proof of Theorem 4.1. The first thing to note is that 

J[z + x] a ~ 1 7](dx) 



(46) Si[z;M Qfi (ri)] 



J[z + x] a r](dx) 



for any z such that | arg(z)| < ir. In order to evaluate the density q a ,r], one 
can invert (46) by means of the Perron-Stieltjes formula, which yields 

q a ,o(y) = ^-rlim{cSi[-y-ie;M Qi0 (r?)] - Si [-y + ie; M afi (77)]} 

Z7T1 elO 

and it can be seen that the above reduces to 

f\ 1 V T fCf • tjf ( Ml 1 T T 7e,a-l(y) -iCe,a-l(£/) 

9a,o(y) = - hmlmjcSi -y - le; M a (r?) } = - hmlm — — - — — — . 

The assumptions J R+ x a ry(dx) < 00 and y £ Qjr) allow a straightforward ap- 
plication of the dominated convergence theorem. This leads to lim £ | le,a{y) = 
la{y) and lim £l0 Ce, a {y) =Ca(y) for any y, while lim £ |o7e,a-i(y) =7a-i(y) 
and lim £ |o Ce,a-i(y) = Ca-i(y) fo r an Y 2/ S Q ,?y The result then easily fol- 
lows. 

B.2. Proof of Theorem 4.2. From Theorem 4.1, it is known that a density 
function q a ^ for J R+ xP Q , j o(dx) is of the form 

ga , (y) = A a ,i(y) = All{y) + a{y)} 
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for any y G ® a , v - Let us now compute the derivatives of 7 a and ( a . In order 
to do so, note that 

lim ^°' y+h) ^ V + h ~ x ) a ^ dx ) ~ /(o,y) (y ~ x) a y(dx) 
HQ h 

f (y + h-x) a -(y-x) a 

= hm / : max) 

HoJ( ,y) h 

— lim f — — ^—r](dx). 

HO J[y,y+h) h 

Since we are confining ourselves to evaluating the density on the set of points 
y in Q a ,r), the probability measure rj does not have a positive mass on such 
y's. Hence, for suitable positive constants c y and k y , one has 



(y + h-x) a ^^ _ , lTr , _ , a 



[3/,y+/i) ^ 



- V (dx) = cyh a - l {*(y + h)- < fc y /i c 



where the first equality follows from the mean value theorem for Riemann- 
Stieltjes integrals and the inequality is a consequence of the fact that is 
Lipschitz of order 1 at y. On the other hand, {(y + h — x) a — (y — x) a }/h < 
a{y — x) a ~ l for any x G (0, y) and for any h > 0. Since x h- >• (y — x) a_1 is inte- 
grable on (0,y) for any y G 8 a ,??, one can apply the dominated convergence 
theorem to obtain (d/dy) fr 0y \(y — x) a rj(dx) = a Lo y \(y — x) a ~ 1 r](dx). The 
same argument can be applied to prove that (d/dy) j, y +00 )(x — y) a rj(dx) = 
- a J( y ,+oo)( x ~ V) a ~ lr l{dx)- These imply that ( a -i(y) = -a^C^y) and 
7a-i(y) = -°<~ lr Ya(y)- Consequently, 

Qa,o(y) = — -y- arc tan , 
omdy j a {y) 

from which one easily obtains the expression of the c.d.f. x i— ► Jq q a ,o(y) dy 
displayed in the statement of the theorem. 



APPENDIX C: PROOFS FOR SECTION 5 

C.l. Proof of Theorem 5.1. From the definition of A a g and the repre- 
sentation of the generalized Stieltjes transform of M a; g(rj), as given in [44] 
and [45], it is apparent that 

A a ,e (*) = - lim lmS e [-t-ie;M a 6 (??)] 

7T ej,0 

Iff 1 ~ 8/a 

= — limlm< / (— t — ie + x) a ri(dx) > , 
vr eio Ux J 
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where X C M + . One has 

y^(-t-i£ + x) a r](dx)} ' =exp|-^log( 7£ia (t)-iC e , a (t))|. 

Let us first confine our attention to the case in which a is in the interval 
(0, 1/2]. Since aarctan(^3j) + a-Kl^^(x) E (0,a7r), for any t and x, one has 
Ce,a(t) > an d le,a{t) > 0. Consequently, 

exp|-^log(7 £iQ (t) -i£ £ , Q (X))j 

= ilUt) + CUt)r 6/(2a) expji- arctan 

Note that the absolute values of each of the two integrands defining 7 £jQ , and 
C, £ .a are bounded by \x — t\ a + K, which is integrable with respect to rj. We 
can then apply a dominated convergence argument to obtain 

lim7 ea (t) =7a(t), limCe ja (t) = Ca(*) 

for any t > 0. This implies (30) after noting that, in this case, T a = 0. 

On the other hand, when a E (1/2, 1), one needs to consider the set r £jQ , := 
{t E M + : 7 £)C( (t) > 0} and note that T^ a n (0, y) is nonempty for some values 
of y in C„. This yields a slightly different form for the arguments of the 
complex numbers involved in the definition of A a< g. One can easily mimic 
the line of reasoning employed for the case a E (0, 1/2] so as to again obtain 
(30). 

C.2. Proof of Theorem 5.2. Since M a)8 (rj) = M 0j8 (Q a ), it follows that 
the distribution functions given in (10) and (24) are equal for all 9 > 0. 
Statement (i) then follows by the uniqueness properties of the integral rep- 
resentations. The first identity in statement (ii) follows immediately by set- 
ting 6 = a in statement (i) which, noting that < a < 1, uses the strict 
positivity sm(n aQ (t)) for Q afi (t) > 0. If one now exploits (i) with 9 = a, 
it is possible to obtain 

A a , Q (t)vr I 1 /" 



e -R a (t) 



sm(iraQ afi (t)) 



We now set 9 = a in (32) to obtain the second identity involved in (ii), thus 
completing the proof. 

C.3. Proof of Theorem 5.3. By definition, 

A a g+ i(t) = -lim{S e+ i[-t - ie; M a e (rj)] - S e+1 [-t + ie; M a B (??)]}, 

7T ej.0 
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which can be seen to imply 

A a e+i (t) = - hm Im— : —!— Jl—J. . 

For any e > 0, |(— t — ie + x) a \ can be bounded by an integrable function 
with respect to rj, not depending on e in a similar fashion as in the proof of 
Theorem 4.1. On the other hand, |(— t — ie + x) a ~ 1 \ < \x — t\ a ~ l +K', for any 
e > 0, x and t. If we further set t £ Qjr) , then x \— ► \x — t\ a ~ l is integrable 
with respect to r\ and the dominated convergence theorem can be applied. 
The expression in (33) easily follows. 

C.4. Proof of Proposition 5.1. Statement (i) concerns the evaluation of 
E 7? [u;™ K (t) sin(7rm/c77 m (t))]. Here, we use the fact that if c mK (t) < oo, there 
exists, by a change of measure, a density for each which is proportional to 
\t — y\~ mK 7](dy). It then follows that, with respect to this i.i.d. law for (Yfc), 
mVm(t) is a Binomial (mp mK (t)) random variable and the result is proved. 
Statement (ii) is derived from (8) using a conditioning argument. Similarly, 
statement (iii) follows from (10), noting that the jumps of 6r] m are less than 
1. 

APPENDIX D: PROOF OF THEOREM 6.1 

The proof follows as a direct consequence of the mixture representation 
of the laws of the P a ^ random probability measures deduced from their 
posterior distributions. Specifically, one can immediately deduce from [33], 
with n = 1, that, 

Pa,e(') = Be+ a ,l-aP a ,e+a(-) + (1 - Be+a,l-a)5Y{-), 

yielding the stated result. Specifically, apply the above identity to P aj e(g), 
where g(x) = x. Naturally, this statement is an extension of the result 
deduced from Ferguson's characterization of a posterior distribution of a 
Dirichlet process; see [10, 11], also related discussions about mixture rep- 
resentations derived from posterior distributions in [21, 22]. As for the 
proof of (ii), this follows from the distributional identity stated in Theo- 
rem 2.1 and by setting Y = M afi (rj) in (i). Indeed, M a>g (r)) = M 0id (Q a ) = 
Be,iM 0i g(Q a ) + (1 - B ejl )M at o(r)). The proof is complete by again applying 
Theorem 2.1 to the first summand of the last sum in the previous chain of 
identities. 
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